Unsupervised dimensionality reduction: the challenges of big data visualisation
نویسندگان
چکیده
Dimensionality reduction is an unsupervised task that allows high-dimensional data to be processed or visualised in lower-dimensional spaces. This tutorial reviews the basic principles of dimensionality reduction and discusses some of the approaches that were published over the past years from the perspective of their application to big data. The tutorial ends with a short review of papers about dimensionality reduction in these proceedings, as well as some perspectives for the near future.
منابع مشابه
Analysis of unsupervised dimensionality reduction techniques
Domains such as text, images etc contain large amounts of redundancies and ambiguities among the attributes which result in considerable noise effects (i.e. the data is high dimension). Retrieving the data from high dimensional datasets is a big challenge. Dimensionality reduction techniques have been a successful avenue for automatically extracting the latent concepts by removing the noise and...
متن کاملMassively Parallel Unsupervised Feature Selection on Spark
High dimensional data sets pose important challenges such as the curse of dimensionality and increased computational costs. Dimensionality reduction is therefore a crucial step for most data mining applications. Feature selection techniques allow us to achieve said reduction. However, it is nowadays common to deal with huge data sets, and most existing feature selection algorithms are designed ...
متن کاملFeature dimension reduction for microarray data analysis using locally linear embedding
Cancer classification is one major application of microarray data analysis. Due to the ultra high dimensionality nature of microarray data, data dimension reduction has drawn special attention for such type of data analysis. The currently available data dimension reduction methods are either supervised, where data need to be labeled, or computational complex. In this paper, we proposed to use a...
متن کاملStability of Dimensionality Reduction Methods Applied on Artificial Hyperspectral Images
Dimensionality reduction is a big challenge in many areas. In this research we address the problem of high-dimensional hyperspectral images in which we are aiming to preserve its information quality. This paper introduces a study stability of the non parametric and unsupervised methods of projection and of bands selection used in dimensionality reduction of different noise levels determined wit...
متن کاملDimensionality reduction for financial data visualization
Various data mining methods are used for examining large financial data sets to uncover hidden and useful information. Ability to access big data sources raises new challenges related with capabilities to handle such enormous amounts of data. This research focuses on big financial data visualization that is based on dimensionality reduction methods. We use data set that contains financial ratio...
متن کامل